annakrystalli

dataviz.shef

“Our plans and hopes for the Data Visualisation Hub”


Anna Krystalli

Research Software Engineering, TUoS

7th June 2018

Background

Why data visualisation?


  • Data visualisation ➡️ an integral part of research communication
  • Exciting new open source 📊 📈 📉 technologies are revolutionising the potential for dataviz as an impactful research output.
  • Little support for training in best practice and tooling

gapminder.org: 2006

liberating stories from data

gapminder.org: today

liberating stories from data

gapmider at our fingertips

library(ggplot2)

p <- ggplot(gapminder::gapminder, aes(gdpPercap, lifeExp, size = pop, color = continent, frame = year)) +
  geom_point() + scale_x_log10() + theme_bw()
plotly::ggplotly(p)

with power comes responsibility

http://www.tylervigen.com/spurious-correlations

Calling Bulls__t

Data Reasoning in a Digital World: Syllabus


Week 6. Data visualization. Data graphics can be powerful tools for understanding information, but they can also be powerful tools for misleading audiences. We explore the many ways that data graphics can steer viewers toward misleading conclusions.

  • Edward Tufte (1983) The Visual Display of Quantitative Information Chapters 2 (Graphical integrity) and 5 (Chartjunk: vibrations, grids, and ducks).

  • Tools and tricks: Misleading axes

  • Tools and tricks: Proportional Ink

background: recap

  • Huge opportunities.
  • Upskilling might be required but worth it
  • Focus on Best Practice necessary.

Enter dataviz.shef

A joint initiative between:

  • The University Library

  • Corporate Information and Computing Services (CICS)

  • Research Software Engineering (RSE)


dataviz.shef goal:


Promote and build community around data visualisation at the University of Sheffield

The team

previous members


orda.shef.ac.uk


orda.shef.ac.uk/visualisations/

Visualisation Showcase goals


  • Help promote researchers’ reposited data
  • Promote data visualisation as impactful research outputs

Path to achieving them


Q: How do we help our researchers make the most of their data and modern data visualisation tools?

Path to achieving them


Q: How do we help our researchers make the most of their data and modern data visualisation tools?


Let’s ask them!

TUoS Data Visualisation Survey

Department

Position

Analysis Frameworks

Current Data Visualisation tools

Aspirational Data Visualisation tools

Experience

Main take aways

  • Still a lot of Excel use
  • R and Python top of researchers aspirations
  • Broad interest in a variety of more niche tools
  • Skill, expertise and level of interest are distributed

Proposed action

Develop capacity through community

via GIPHY

Develop community of practice


🙌 👩‍💻 🤝 👨‍🔬 👩‍🎓 🤝 👨‍🏫 👨‍💻 🤝 👩‍🔬 👨‍🎓 👩‍🏫 🙌 👯


Support through:

  • Documentation on dataviz best practice and use of tools
  • Communication channels
  • Capacity building opportunities through Events

Documentation

OxShef

Main home of best practice and tooling documentation

  • twitter: @oxshef_dataviz

Full credits for work so far to Martin Hadley of The University of Oxford IDN]

Chart type vs tool matrix

  • Comprehensive list of chart types
  • Resources for developing the same charts using different tools.
  • Initial focus on R and Python but open for expansion

dataviz.shef:

Data Visualisation Hub website

  • Information on community events and activities
  • Documentation specific to the ORDA Data Visualisation showcase

Open Source Documentation

  • All materials are CC-BY
  • All materials are open for contribution on GitHub
  • Got a suggestion 💡? Bug report? Feel free to open an issue.

Communication

Google group

  • low volume announcements

Slack community

  • free sharing
  • good for sharing useful links, blogs, tutorials, questions and chat.
  • say hello 👋 in the #welcome channel!

Data Visualisation Hub Blog

Events

Launch

Workshops

Instructor and/or peer to peer led.

Upcoming

  • Data Visualisation in R using ggplot2 (Jun-Jul)


Wishlist

  • Data Visualisation in Python using altair
  • Interactive Data Visualisation in Shiny (R) & Dash (Python)

Symposia

The Missing Link in Publishing: Interactive Data Visualisation 4th December 2017, University of Oxford

Hackathons

NBN Sheffield R Users Group hack:

Coding clubs

Meetups

Sheffield Data For Good

Bringing together the data and social expertise in Sheffield to help tackle the city’s social problems

Success is in the network

Get involved!

  • Share your data visualisations on ORDA showcase
  • Join the google group &/or slack community
  • Contribute to the blog
  • Contribute to the documentation
  • Get in touch with ideas for training, coding clubs, hackathons

dataviz.shef.ac.uk

  • a.krystalli[at]sheffield.ac.uk